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ABSTRACT 

Four experiments investigated college students' 
judgments of inter-event contingency. Subjects were asked to judge 
the effect of a discrete response (tapping a wire) on the occurrence 
of a brief oVtcome (a radio's buzzing). Pairings of the possible 
events-state combinations were presented in a summary table, an 
unbroken time line, or'a broken time line format. Subjects judged the 
extent to vhich the response v caused the outcome or prevented it from 
occurring. Across all methods of information presentation, judgments 
were a positive function of response-outcome contingency and outcome 
probability. In the unbroken time line condition, judgments of 
negative response-outcome contingencies were less extreme than 
judgments of equivalent positive contingencies. Judgments of positive 
and negative relationships were generally symmetrical in the summary 
table condition. Summary table judgments were less influenced by the 
overall probability of outcome occurrence. These judgment differences 
among format conditions suggest that, depending on the method of 
information presentation, subjects differently partition event 
sequences into discrete event pairings. (Author/MNS) 
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* abstract 
A, series of four experiments investigated college students' judgments of 
interevent contingency* Subjects were asked to judge the effect of a discrete 
response (tapping a wire) on the\>ccurrence of & brief outcome (a radio's 
buzzing). Pairings of the possible event-state combinations (response-out- 

4 

come, response-no outcome, no response~outcom£ * no response-no outcome) were 
presented in a summary table (Experiments 2 and 4), in .an unbroken time line 
(Experiments 1, 2, and 4), or in a broken time line' format (Experiment 3). 
Subjects judged the extent to which the response caused the outcome or ore* 
vented it from occurring. Across all methods of information presentation, 
judgments were a positive function of response-outcome contingency arid outcome 
probability. In the unbroken time line condition, judgments of negative 

response-outcome contingencies wereT/ess extreme than judgments of equivalent 

v. 

positive contingencies. This asymmetry was smaller in the broken time line 
condition and in those conditions where subjects were encouraged to segment an 
unbroken time line into discrete response-outcome units. Finally, judgments 
of positive and negative relationships were generally" symmetrical in the 
summary table condition. Relative to the two time line portrayals, summary 
table judgments were also less influenced by the overall probability of out- 
come occurrence. These judgment differences among format conditions suggest 
that, depending on the method of information presentation, su'jects differently 
partition event sequences into discrete event pairings* The segmenting of 
continuous ev&nt streams day be an important factor in the accuracy of every- 
day judgments of interevent contingency, * 



Children's Judgments about Covariation between Events: 
A Series of Training Studies. Appendix E. 
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\ And now remains, 

That we finu out the cause ol this effect, 
Or rather say the cause of this defect, 
> For this effsct defective comes by cause. 

- W* Shakespeare;* Hamlet* XI, ii 

Students of behavior, both before and after Shakespeare* JiaVe been Interested 

* in causal perception. Most noteworthy was D. Hume (1739) who proposed a set 

of conditions which were conducive to cause-effect impressions. Hume's 

insights into the psychology of causation have helped to shape the direction 

- - X 
of subsequent research and theory in the area. 

Also important have been discussions of causal perception from compara^ 

tive and developmental perspectives. C> L. Morgan (1893, 1894) concluded on 

the basis of extremely limited evidence that Wnan adults, but not children 

and animals, can perceive ti\e relationship between events. More systematic 

data led Inhelder and Piaget \(1958) to propose .a ^tagewise unfolding of the 

human 1 s conception of interevent correlation or contingency as the individual 

develops from child to adult. 

j Subsequent investigations into the perception of intereVent relations 

have not yielded evidence that is consistently favorable to the developmental 

and evolutionary speculations of Morgan and of Inhelder and Piaget. Nor is 

the evidence particularly supportive of modern theories, which posit a 

virtual identity between humans' and animals' perceptions and the actual 

interevent contingencies that prevail in their environments (e.g., Heider, 

1958; Kelley, 1967; Mackintosh, 1974; Rescorla,* 1978). 

In the bfteic human judgment paradigm, subjects are giv*m information 

e 

about the frequency of pairimjs of alternative states (e.g., presence and 
absence) of two events (e*g.., plant food and plant health); they can then be 
asked to judge the direction and magnitude ot the relationship between the 
events. In many of these experiments, adults do not accurately jutfge the 
CD 9 r >- correlation between two binar/ variables (see Crocker, 1981 for a review). 
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Despite these negative results* other work has been more successful in 
^showing that adults can accurately judge inter^vent relations under some 
circumstances (e.g., Allan & Jenkins, 1980; Alloy & Abramson, 1979; Seggie, 
1975; Seggie & Endersby, 197&^ Shaklee & Tucker, 1980). , Nevertheless, many 
factors have been suggested over the past 20 years which may contribute to 



distortions in the perception of correlation. 



< 

Investigators havr found that the accuracy of correlational judgments 
depends on the sign of (he relationship being judged. In p^gticular, Erlick 
and Mills (19&7) found that subjects judged negative correlations as closer to 
zero than, positive correlations of equal magnitudes. Also common is the 

result that subjects find contingencies of zero to be especially difficult to 

* * 
identify. For example, Seggie (1975) reported that subjects were accurate in 

their judgment s of^contingantu'relat ionships , Jrot vera eTrDT=prOTeTn Judging 

noncontingent relationships (also see Allan, 1980; Allan & Jenkins, 1980). 

Alloy and Abramson (19^9) replicated this pattern of differential accuracy in 

nondepressed subjects, but found that depressed adults judged noncontingent 

problems closer to zero than did nondepressed subjects. * % 

One must, however, be cautious in interpreting the effects of relation-^ 

ship direction; subjects may approach the stimuli in_ question with strong 

expectations about ^he nature of the relationship that will hold. In Seggie's 

1975 study, for example, subjects juJge^whether or not hospitalizing a victim 

of a tropical* disease would improve the chances of recovery. Erlick and 

Mills 1 (1967) subjects judged the relationship between the quantity of a 

particular food a person ate and whether the person felt better or worse, 

People who believe in the merits of medical science or hearty eating would be 

likely to ebtpect each to improve general well being- This expectation could 

produce a bias to report relationships as positive, resulting in errors in 

judging negatively related and independent events. 
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^Evidence of such an expectation effect vas found in tha research of 
Chapman and Chapman (1967 -a & b), where subjects judged there to be a positive 

relationship between semantxcaily-assoclated clinical signs and symptoms in 

i 

stimuli that actually presented the sign and symptom as independent, or* even 
negatively related* This illusory correlation effect proved to be highly 
resistant to a variety of attempts to reduce it, including exposing subjects 
to the Stimuli several times and offering them a $20 reward for accuracy. 
Similar expectancy effects may be a reason for some past findings of differen- 
tial accuracy as a function of relationship direction* -Any attempt to examine 



the effect of relationship direction shpuld then be conducted in a context in 
i 

i^T^w 

which prior expectations are minimal* 

1 A second common finding in past research is that judgments of interevent 



correlations are biased by the relative frequencies of the event states of the 
variables involved* For example, Jenkins and Ward (1965) asked subjects how" 1 
much control their responses (pushing Button 1 or 2) had over the frequency 
with which a score*light appeared* Subjects' judgments of control were most 
, strongly correlated with the number of times the score light occurred, regard- 
less of whether that outcome was~actually influenced by their, choice of buttons* 
Allan and Jenkins (1980) found that this bias was reduced, but not eliminated 
when subjects had a ^single imtton to press or not to press, compared to Jenkins 
and Ward's two-button condition (also see Alloy ^^ramson, 1979)* The findings 
of these investigations indicate that the probability of the outcome ii a 
second possible confound to be controlled or manipulated in assessing contin- 
gency judgment* 

A final recurrent finding in past research i^that the accuracy of judging 
interevent contingency depends on how the event frequency information 'is 
presented* Two common formats present this information either as a series of 
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individual event-state combinations (e.g., Alloy & Abramson, 1979; Shaklee & 
Mims\ 1982; Ward & Jenkins, 1965) or as a summary table (e.g, , Seggie, 1975; 
Smedslund,- 1963; Ward & Jenkins, 1965). Experiments which have compared the 
two presentation formats have found accuracy to be higher when the frequency 
information is summarized in tatle formats 

Of courste, the serial and summary formats differ in a variety of ways, 
Most obvious is the added memory demand involved in the trial-by- trial presen- 
tation of information; thus, subjects who add a strong^memory load to an 
already complex judgment process may compromise accuracy to simplify an over- 

r 

whelming task. Shaklee and Mims (1982) relied upon such a memory account in 
interpreting their judgment findings, Ward and Jenkins (1965), howevgr, 
argued that^, while important , 'memory load cannot fully account for the judg- 
~~ ment difference" between setial aid summary formats. Rather, they jjyroposed 
that the serial presentation of stimulus information may lead subjects to 
organize £he information differently from those fcho view the same 'information 
in a tabled format. In support of this point. Ward and Jenkins note that 
aubjects in their experiments wjio were shown tabled information after N serial 
presentation, used less appropriate judgment strategies than those who saw only 
the tabled information. If information is organized differently jinder the two 
conditions, then this may lead subjects to make different judgments of inter- 
event relationships/ Although this reasoning is plausible, past paradigms 
have confounded presentation format with memory load; the contributions of 
memory and organization effects in past research cannot then be separated* * 
The issue is best addressed by com>aring use of. serial and summary frequency 
information in conditions alike in memory load* 

The present study thus compared serial and summary formats in a setting 
free i>f memory demands, while also using a problem for which subjects should 
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have little bias as to the naciire of the interevent relatioti. The basic 
Situation involved troubleshooting a malfunctioning radio. While this situ- 

r * 

atiori is far less dramatic than Folonius' effoxts Co determine the reason for 

• it 

Hamlet's odd behavior, it is .nonetheless representative of everyday instances 
of causal reasoning. 

Subjects were told that an individual was trying to find the cause of an 

t 

intermittent buzz (B) by occasionally tapping (T) on a wire inside the radio. 

The results of the troubleshooting were then given to the subject, who was 

asked £o judge' the degree to which tapping affected the radio's buzzing; from 

(i * 
"causes the sound to occur" to "has no effect on the sound\to "prevents the 

sound from occurring. 11 This context has the virtue of being one in which 

subjects should not have a strong expectation about the nature^f the response*, 

outcome relationship; tapping a wire should be as likely to complete as to 

J9>reak a loose connection. Similarly, if the wire is not loose, tapping i£ 

should h&Me no effect on the buzz. 

/ 

Holding constant the probability of tapping, £(T), both the probability 
of a buzz given a tap, £(B/T) , and the probability of a buzz given no tap, 
* £<B/T), were systematically varied to yield 24 different troubleshooting 
conditions. These conditions in turn constituted nine tap-buzz contingencies, 
£{B/T) - £(B/T), ranging in . 25-steps .from -1.00 to +1.00 (see Allah, 1980 for 
further distussion of various measures \f contingency or correlation).- 

An additional feature of the 24 troubleshooting conditions was that they 
were contrived in such a way that they varied not only in th£ tap-buzz contin- 
gency, but also in the overall probability per sampling interval of the 
buzzing sound, £(B). Eight different buzz probabilities were studied, ranging 
in <125-steps from .125 to 1.000. Because the tap-buzz contingency and the 
relative frequency of the radio's buzzing vs its not buzzing were independent 
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dimensions in the present experimental design, the contributions of chase 
variables to subjects' judgments of correlation could be individually assessed. 

The method of information presentation was studied with two basic techni- 
ques. In one, subjects were given summary tables showing the numbers of times 
that the four possible event sequences occurred in 24 sampling intervals: 
tap-buzz, tap-no buzz, no tap-buzz, H and no tap-no buzz. In the othet*, the 
same information was given in a time line format, with the 24 sampling inter- 
vals graphically and linearly arrayed. Such an arrangement preserves the 
sequential character of the critical events, while minimizing the strong 
memory demands that are ordinarily placed on subjects when they are given 
information in a trial-by- trial fashion. This method was originally suggested 
bf Ward and Jenkins (1965, p« 240); however, it has never been utilized in- 
experimental research. 

Since past work has not entailed a time line^ presentation of event, 
frequencies, our series of investigations began by looking at subjects 1 judg- 
ments using this format alone. Experiment 1 explored the effects of tap-buzz 
contingency and buzz probability on judgments of tap-buzz conciliation 1$ both 
withln-'subjects and between-subjects paradigms. Experiment 2 directly 1 com- 
pared the effects of the time line and summary table methods of information 
presentation. Because the second experiment disclosed that judgments did 
differ under the two conditions of information presentation, Experiments 3 and 
4 explored possible reasons for the judgment differences. 

Experiment 1 

The first experiment investigated the. judgment of response-outcome corre- 
lation when responses and outcomes were shown to subjects in a time line 
format. In jple >part* of the experiment, each subject received only 1 of 24 
possible tap^b\izz conditions; in the 6ther part, each subject received allr 24 

■*( 
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tap-buzz conditions* |oth between- and within-subjects conditions^ were included 
in order to identify: possible influence3 of multiple judgments, since we hoped 
to use the more efficient within-subjects procedure inlat^r work. Subjects' ^ 
ratings of the response^outcome relationships allowed us to determine 
degree to which the tap-buzz contingency, £(B/T) - £(B/T) t ^nd the overall 

probability of the buzzing sound, £(B) , influenced their behavior. To deter- 

# * I* 

mine whether the sign of -the response~outcom£ : /corr elation" affected subjects' 

.y 

judgments, equal numbers of positive and negative contingencies were studied. 
Method^ ' ' x / 

i 

Subjects , the subjects were participants in an introductory psychology 
class, who served in the experiment as one £pt ion for "fulfilling a course 

i * • 

requirement. A tital of 552 students served in the betwee^su6jects part of 
the experinfent anji a total of 25 students served in the witnin-subjects part* 

Problys . set of 24 problems was constructed. These proteins we*e 
alike in that they 56^1 comprised 24 sampling intervals. Each sampling inter- 
val in turn had two components : a * .sponse" component during which a tap 
might or might not occur, and an "outcome" component during which a buzz might 
or might not occur. Each of the 48 resulting components of a problem was 
denoted on the subject's problem sheet as a dash; thfe_4£ eonsScut^tfe dashes 
thus constituted the time line for each problem. Taps in the response colli- - 
ponent of a sampling interval were denoted by an "A n above the dashed time 
line, and buzzes in the outcome component of a sampling interval weije denoted 
fiy a "B" below the dashed time line. f \ 

^g^For all '24 problems, phere were 12 taps represented in the 24 possible 
response components. Thus, the probability of tapping per sampling internal, 
£(T), was always f50. Problems varied in terms of the likelihood that a buzz 

was represented in the outcome components, , and the likelihood of buzzes 
• i i 

following taps, £(B/T) , and no tapfc, £(B/T) , in the response components. 

. i 10 
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I For each of the 24 problems, Table •! shows the numbers of sampling intfcr- 

t 

vals of each of four possible types: tap-buzz, tap-no buzz, no tap-buzz, and 

no tap-no buzz. Note that the number of sampling intervals with a tap is 

equal to 12, which is the same as the number of sampling intervals without a 

* "4* * 

tap. Note &lso that the rotal number of sampling intervals equals 24. And 

note finallythat the number of sampling* intervals with a buzz varies^ from 3 

to 24. . * 



Insert Table 1 about here 



For each problem, time lines were constructed from smaile^ groupings that 
contained eight sampling intervals. The sequence of ^vent pairings was deter- 



mined randomly within each eighl -sample group. While eight "sampling groups 

* * >* 
I 

theoretically provide all the necessary information that is needed to distin- 

^ \ " v " 

guiah the 24 problems, we thought it advantageous to triple the amount olf 

input given to She subjects In hopes that their judgment^ might thereby be 

/ 0. 
improved. For example, -Problem 18 in Table 1 was represented aa follows: 

A A A A -A AA A A A^A Jl * 

B B , B B B B B B ™B B ~B " 

* Figure 1 shows a second method of depicting the 24 problems that were 
studied. Both the top and bottom portions of the figure/ locate each problem 
within the unit square defined by the two independent conditional probabilities 
£(bZt) and.£(B/T). The top portion of the figure shows the response-outcome 

T - ' ■ * ' " ' ■ 

contingency, £(B/T) - £(B/T), of each of the problems; the bottom portion 
shows the likelihood of the buzzing sound per sampl/ng interval, £(B) , for the 
same problem set* There *are nine response-outcome contingencies and eight 
proJjHBllities o£*huzz presentation represented by the 24 problems in Figure 1. 
Furthermore, these two procedural dimensions are orthogonal, as can be seen by 
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the o^^jsite slopes of the lines that connect the 24 ^problems in the top and ' 
boti^ci' pprtions of the figure. From the figure it can finally be seen that 

/* on6 possible problem was not included in the set- When £(B/T) » 0 » jy(B/T)» , 

* * ■ 

^(©) » 0; little sense could thus have been made of the task by the subjects 

(see next section for questionnaire instructions), 
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Procedure . Subjects were'' given problem sheets that each contained^ 
instructions, a time line, and a rating scale. The r instructions read as 
follows: 

. AfteiJ buying a new radio, Kim finds that it emits a brief 
burring sound every s^ often. Kim finds this buzzing sound 
1 annoying and decides/to find its cause. Removing tihe back 

of the radio, Kim. Suspects that a wire may be loose. Kim 
chooses a wire and taps on it a number of times in order to 
. see if this'has any «fect on the buzzing sound* In the 
diagram below, Kim* . tapping on the wire is shown by an t 
A above the time li:.e which moves from lef t-*to-riRht across 
the page. An occur, e ace of the brief buzzing sound is ^ 
shown by a B below he time line. 

One of the 24 different time iines thun followed. Below the time line was a 

nine-point rating scale ranging from ~4 (prevents sound from occurring) to 0 

r I 

('has no effect) to +4 (causes soutid to occur) * Subjects were asked to circle 
the number that best corresponded to their\ansver to the question, "If you , 

X 

were Kim, what would you conclude was the effect of tapping on the yire?" 

* In the between-subjects ^art of the experiment / only 1 of the %k probleto 
sheets was given to each subject. In the wi thin-subjects part of the experi- 
' ment, each subject received all 24 problem sheets, with the order of thfe 
* slle&ts randomly determin^STT^o ■ each subject. The 24 problem sheets were 
'clipped together; each packet also included the following cover sheet:. \ 
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Hie aim of this experiment is to see how people judge 
the ^lationship between their actions and the consequences 
of those* actions. In the 24 sheets that follow, the same 
basic problem is posed: What is the relation between Kim 1 ^ 
tapping on the wire of a malfunctioning radio and the 
occurrence of a brief buzzing sound that the radio occasionally 
emits. The 24 sheets differ only In the particular relationship 
between Kim's tapping and the occurrence of the sound. For 
each of the 24 sheets, please rate the degree to which Kim's 

• tapping affects the rate of the radio's buzzing, from "prevents 
the sound from occurring 1 ' to "causes the sound T to occur." As 
you go through the 24 problems, you'll soon see that the problems 
differ from one another to varying degrees. You may sometimes 
want to look back to prior problems; you may even want to change 
prior responses. This is OK. It is more important to work 

Vthrough the" problems carefully and methodically than to give 
quick and offhand reactions. Indeed, the materials are paper- 
clipped together so that you can sort through the many sheets 
and organize them any way you wish,- 



Results 



Table 2 shows the^neans and standard deviations of subjects' judgments 
for the 24 problems in both the between- and within-subjects parts of the 

' \ ■ 

experiment:. + Each of the 24 problems is located in the table by the coor- 
dinates £(B/T) - £(B/T) and j>(B). In general, subjects 1 rating scores were 
positive functions of Jbotlli £(B/T) - p(B/T) and £(B) . 



In jert Table 2 about here 



Figure 2 graphically por rays subjects 1 rating scores as separate func- 
tions Of £(b7t). - £(b/T) and ^(B) in each part of the experiment. An^ly£is of 
variance simultaneously ass.ts ed the reliability of these two sets of functions, 



Ins* rt Figure 2 about iere \ 



X 

The left panel of Figure 2 displays subjects 1 ratings as a function of 
£(B/T) ~ p,(B/T). The j sitiv* diagonal in the figure shows the responses of a 
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hypothetical judge whose responses correspond in a linear fashion to the 
actual response-outcome contingencies and ^ho also employs the full rating 
scale. In the between- and wi thin-subjects parts of the experiment, subjects 1 
judgmentsr^rere reliable linear functions of £(B/T) - £(B/T) , V(l, 528) - 
139*17, £< .001, and F(lj 24) = 74.76, £ < .001, respectively; however, the 
slopes of chose functions were clearly less than that of our. hypothetical 
linear observer. The between- and within-subjects functions also had reliable 
quadratic components, F(l, 528) - 11.28, £ » .001, and F(l, 24) = 28.07, 
£ < .001, respectively; this trend appears to be due to the negative segments 
of the functions having shallower slopes than the positive segments. Finally, 
in the vithin-subjects part oi the experiment, the contingency -rating function 
had a reliable cubic component, F(l, 24) = 10.96, £ * .003; this trend appears 
to be due to the function having an inverted £ shape. Although the overall 
form of the between-subjects function was similar, it did not have a reliable 
cubiw component. 

The right panel of Figure 2 displays subjects 1 ratings as a function of 
]>(B). In the within-subjects part of the experiment, ratings were a positive 
linear function fit £(B), F(l, 24) - 32.63, £ < -001. In the betveen-subjects 
part of the experiment, the linear trend only approached significance, F(l, 
528) » 2*90, £ » ,089. 

To assess the relative contributions of £(B/T) - jKB/T) and £(B) to sub- 
jects' judgment,^scores , the percentage of problem variance accounted for by 
these factors was determined through the cubic component each; beyond the 
cubic component, no significant variance remained for either part of the 
experiment. In the between -subjects part of the experiment, £(B/T) - £(B/T)^ 
accounted for 36,47% of the total variance and £(B) accounted for 3,21%; in 
the within-subjects part of the experiment, the corresponding scores were 
71.87% and 24,10%. 

14 
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Discussion 

Subjects 1 judgments j>f contingencies in the time line format showed 
several interesting trends that were generally comparable in the within- and 
between-subjects parts of the experiment. These results also accord well with 
past paradigms using different presentation formats. First, judgments of 
response-outcome correlation were a reliable^ function of the ^ntingfency 
between the tapping of a wire and the occurrence of a brief buzzing souftd. 
Subjects* ratings rose as the tap-buzz contingency, £(B/T) - £<B/T), increased 
from negative to positive values* Thus, subjects clearly showed^some sophis- 
tication about appropriate bases of contingincy judgment. 

The relative accuracy of subjects* judgments is, however f another issue. 
Mean judgments indicated that subjects rated noncontingent relationships close 
to zero, but ratings of several negative relationships hovered close to zero 
as well. While subject? T ere asked to rate both the degree and che sign of a 
correlation, the clearest evidence of accuracy here was the rated direction of 
the relationship* Subjects* judgments Should also have been ordered according 



it ion. wtii 



to^the strength of the correlation, yhile this was generally true, the ratings 

yielded contingency judgments that wgre poorer than ideal. Indeed, the quad^ 

/' 

ratic component of the judgment function indicates that subjects did not treat 

/ 

positive and negative relationships symmetrically; contingencies of the same 
absolute value were rated as stronger for positive than for negative tela- 
tionsWpa. The form of this difference in ratings of relationship strength 
closely resetnbles that found -n prior research by Erlick and Mills (1967). 

The second main finding uas that judgments of correlation were reliably 
influenced by the likelihood *>f the buzzing sound t j>(B) . This bias is com- 
parable to that found in other studies in which the judgment of contingency 
depended on the likelihood ttut the .outcome occurred (Allan & Jenkins, 1980; 
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Alloy & Abramson, 1979; Jenkins & Ward, 1965K These prior studied most con- 
vincingly demonstrated a bias effect of j>(b) with response-outcome contin- 
gencies of zero; Allan and Jenkins* (19S0) investigation further suggested 
that the bias effect could arise under positive contingencies. The present i 
report confirms the above trends and also shows that the effect of ]>(B) on 
judgments holds under negative response-outcome contingencies as well (see 
that ratings tend to increase from top to bottom within most columns of Table 
2)< ■ 

Experiment «! v 

The Results of the time line portrayals in „^periment 1 were comparable 
in many ways to tho3e of past paradigms. However, subjects who view informa- 
tion in a particular format may treat the information in a manner specific to 
that format; thsc is, subjects' attention to information may depend on the way 
the information is presented* The organization or integration of attended 
information n&y vary with stimulus format as well. We propose three ways in 
which the time line and the wore familiar summary table format may produce 
different judgments* 

^irst, tabled presentation of event frequency information offers the ' 
subjects tallies of the frequencies of each type of event-state combination* 
Our time line presentation (like past Serial presentation techniques) requires 
the subjects to generate such taHies on thair own* Subjects ^iven time line, 
information may guess rather than count those frequencies, resulting in esti- 
mation errors. This logic suggests that Judgments with time line presentation 
will be generally less accural than judgments w/Lth tabled presentation and 
that such differential accuracy will be relatively constant across positive^ 
negative, and noncontxngent relationships* The resultant judgment function 
chould^ be relatively flat across all contingencies compared to that of tabled 
information. j 
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A second possible source of diffeience is the fact that the summary table 
presents the event-state combinations in a form of comparable salience. In 
contrast, each type of event pairing has a unique representation in the time 
line format (i*e., AB, A-, ~B, — ). As a result, some types of event palings 
may be more salient than others. In particular, the interval pairs with two 
event absences ( — ) may be less prominent than those with one or both events 
present. This feature may also have been true of past serial presentation 
paradigms. If so, subjects should underestimate the frequency of no tap-no 
buzz pairings. Since the denominator of the conditional probability, £(B/t), 
would then be smaller than woqld be accurate, this would result in an estimate 
of £(B/T) that is too high. This in turn should result in a bias to judge 
continr acies as being more~negative in the time line format than the same 
contingencies presented in the tabled format. 

Finally, the time line format allows the subject to determine the delay 
between tap and buzz that will be counted as a tap-buzz pairing* Consider the 
interval series A — B. The tabled format would represent this as one occurrence 
of tap-^no buzz and one of no tap-buzz. However, a subject given the time line 
presentation may well consider this series to be a single pairing of tap-rbuzz, 
This tendency would lead to an underestimation of the frequencies of event 
pairings tap-no buzz a;- no tap-buzz and ai> overestimation of the frequency of 
tap-buzz pairing's. These errors would yield an inflated numerator for £(B/T) 
and a smaller than accurate numerator for £(B/T) « These biases should 
result in judgments of contingencies being more positive in the time line than 
in the summary table format. This problem of event segmenting should not have 
been true of past discrete trial presentations, where each slide or card 
defined p** event-outcome pairing. However, the problem may be true of event 
processing in real time, when event con t in u a must be defined as discrete 



events . 
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Thus, each of three reasons for judgment differences in The two informa- 
tion presentation conditions would result in a unique pattern of judgment 
out comes * Whether any of chese differences wlLl materialize is an empirical 
question* Experiment 2 addressed this issue by comparing judgments under the 
time line format employed in Experiment 1 with jur#aents of the same problems 
presented in the summary table format used in past investigations (e.g., 
Smedslund, 1963; Ward & Jenkins, 1965). Since judgments were so comparable in 
the between- and vithin-subjects partp of Experiment 1, subjects i,n Experiment 
2 judged all 24 problems. ' r ^ 

Method 

Subjects * The subjects were 34 undergraduate research participants. 

Problems . The same 24 problems were used here as in Experiment 1* 
Problems in the time line format were typed on a single sheet of paper with 
the nine-point rating scale to the right of each problem. Problems in the 
summary table format were typed on Another shefet of paper similar to Table 1, 
except that the four types of sampling intervals were vertically arrayed; 
identical rating scales were located beneath each problem. Problems were 
presented in a single random sequence for the time line format and in a 
different random sequence for the table format. 

Procedure . During the first portion of the experimental session, sub- 
jects were given an instruction sheet describing the troubleshooting problems 
on the attached sheet of paper. For half of the subjects the problems were in 
the time line format, and for the other half the problems were in the summary 
table format. During the second halt of the session, subjects worked problems 
iu the format not worked in the first half. Instructions for time line prob- 
lems >rere the same as those used in Experiment 1. Instructions for summary 

table problems were the same, with appropriate adjustments to introduce the 

i * 
table rathe r *.han the time line format. 
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Results * 

Table 3 shows the means and standard deviations of subjects*! judgments 
for the 24 problems given in the tim*» line and summary table formats- Because 
analysis of variance failed to disclose any reliable effects attributable to 

Si 

the order of format presentation, this factor is not considered in Table 3 nor 
in lat^r data analysis. As in Experiment 1, subjects 1 ratings were positive 
functions of both £(B/T) - £(B/t) and £(B). 

Insert Table 3 about here 1 

t 

\ 

\. 

Figure 3 graphically depicts subjects' rating scores as separate func- 
tions of £(B/T> - £(B/T) and £(B) f or each method of information presentation* 
Analysis of variance simultaneously compared these two s*ts of functions. 



Insert Figure 3 about here 

The left panel of Figure 3 portrays subjects' ratings as a function of 
£(B/T) - £(B/T). Overall/ ratings were reliable linear, F(l, 32) * 51.72, 
£ < ,001* and quadratic, F(l, 32) » 12.90, £ « .001, functions of tap-buzz 

contingency. Additionally, \there was a reliable Quadratic contingency by S 

\ 

format interaction, F(l, 32) \» 4.97, p » ,033, To pinpoint the source of this 

\ 

interaction, separate analyses of variance were conducted on the time line and 
aummary table data. For both the time line and the summary table formats, 
ratings were reliable linear functions of contingency* JF(1, 33) ** 36.77, 
£ < .001, and F(l, 33) » 44.27, £ < .001, respectively. However, the quad* 
ratic trend was reliable for the time line format only, F(l, 33) * 14.59, £ * 
,001. Thus, aubjeccs' judgments were reliable linear functions of response^ 

IS 
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outcome contingency with both methods of information presentation; however, 

the method of information presentation influenced those functions , with the 

^tabled format supporting judgments that better approximated those of an ideal 

observer, particularly in the region of negative contingencies, 

The right panel of Figure 3 illustrates subjects' ratings as a function 

of £(B). Overall, ratings were reliable linear, F(l, 32) » 30.11, £ < .001, 

and quadratic, F(l, 32) - 26.68, £< .001, functions of outcome probability. 

Additionally, there were reliable linear, F(l, -32) - 6.32, £ - .017, and 

quadratic, F(l, 32) - 12. 99, £< .001, outcome probability by format inter- 
* 

actions. Because of these interactions, follow-up analyses were separately 
performed on the time line and summary table data. For the time line data, 
ratings were reliable linear, F(l, 33) = 34-37, £ < .001, and quadratic, F(l, 
33) - 30.43, £ < .001, functions of £(B) ; for the summary table data, the 
linear trend was reliable, F(l, 33) = 5.33, £ - .027, and the quadratic trend 
fell just short of statistical significance, F(l, 33) = 3,69, £ - .063, Thus, 
the -method of information pre ntation altered the influence of outcome proba- 
bility on subjects' ratings; providing the information in a time line format 
both steepened the probability- judgment function and increased its curvature 
relative to providing the same information in a summary table format. 

And, regardless of tap-buzz contingency and buzz probability, judgments 
were reliably higher in the time line condition, than in the suknary table 
condition, F(l, 32) » 5.03, £ - .032, 

To assess the relative contributions of response-outcome contingency and 
outcome probability to subjects' ratings, the percentage of problem variance 
accounted for by each factor was determined as in Experiment 1. For the 
summary table data,' £(B/T> - £(b/t) accounted for 81,35% of tfie total variance 
and £(B) accounted for 12.58%; for the time line data, the corresponding 
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scores were 39.48% and 51.79%, Beyord the cubic component* no significant 
variance remained for the summary tabl^ data. For the time line data, the 
8,78% remaining variance was small, but statistically significant, £(17, 361) 
- 3-23, £ < -001. ^ 
Discussion 

The data from subjects given ttis time line in this experiment replicate 
the judgment patterns of subjects in the comparable condition of Experiment 1. 
In addition* th* results of Experiment 1 2 confirm prior findings (Shaklee & 
Mims, 1982; Smedslund, 1963; Ward & Jenkins, 1965) that the method of infor- 
mation presentation affects subjects' judgments of response-outcome correla- 
tion. 

The obtained judgment differences under two conditions comparable in 
memory demands suggest that past effects of presentation conditions may not be 
solely attributed to memory. In general, subjects* judgments were more 
closely attuned to response -out come contingency when Information was given in 
the summary table than when the same information was given in the time line. 
First, the contingency-judgment function (left panel of Figure 3) was mure 
symmetrical about zero in the Summary table condition, suggesting that sub- 
jects rated positive and negative relationships in a comparable fashion. 
Again, the time line portrayal supported less accurate judgments of negative 
than positive contingencies. Second, table format judgments were less dis- 
torted by the probability of the buzzing sound (right panel of Figure 3). The 
linear contingency by format interaction showed that the time line judgments 
were -steeper functions of £(B) than the summary table judgments. 

We previously reviewed three reasons why time line and surjnary table 
formats may result in different contingency judgments. The suggestion that 
the time line will lead to more errors in estimating frequencies of event 
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pairings than the summary table predicted overall poorer contingency judgment 
accuracy (I.e. , a flatter, but symmetrical contingency-judgment function) in 
the time line than in the tabled format condition. The possibility that joint 
event absences (no tap-no buzz) were less salient in the time line than in the 
, tabled presentation 'mode predicted a general bias to report relationships as 
more negative in Ihe time line than in the summary table format. However, 
neither of these difference patterns describe our results. 

Subjects in this experiment did show a tendency to judge relationships as 

more positive in the time line than in the summary table condition. This 

. ... 

result supports our third proposed source of differences, that subjects may 
group event pairings differently in the time line than the tabled format. In 
particular, event tesies A— B could be identified as a single tap-buzz occurrence 
rather than a tap-no buzz and a no tap-buzz, yielding in a bias to report . 
relationships as positive. However, we should note that while ratings were 
generally higher in the time line than in the summary table condition, the 
positivity bias was more pronounced for negative than positive contingencies. 
One possible account for this finding involves the influence of context on the 
grouping of event pairings; that is, A— B may be most likely to be judged a 
tap-buzz occurrence when there are few contiguous AB pairings in the time 
line, as would be the case in negative contingencies. 

Besides helping us to understand why different presentation formats sup- 
port different judgments, these performance differences between groups also 
allow us to reject the possibility that time line subjects' problems with 
rating negative contingencies are due to a response bias or to prior expecta- 
tions. Any expectation about the effect of tapping on the radio's buzzing 
should be the same in the two groups, but judgments of negative contingencies 
were distorted fcrtf^time line subjects only. Similarly, since subjects made 
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judgments on the same rating scale In the tvo conditions* performance dif- 
ferences cannot be attributed to peculiarities in the scale Itself, 

Experiment 3 

The results thus far suggest chat subjects may define events differently 
In the time line and table formats. If this is the principal reason for the 
inaccurate responses of time line subjects, then their judgments should 
improve when the continuous stream of events In the time line la separated 
Into discrete units* 

Our third experiment further explored 'the problem of defining Individual 
sampling periods by placing a clear break between paired .Intervals In the time 
line format* To do this, we simply, added a blank space between successive 
sampling intervals along the time line. As In the wl thin-subjects part of 
Experiment 1, subjects rated all 24 tap-buzz contingencies* These judgments 
were compared to those obtained In Experiment 1,. in which successive sampling 
intervals Immediately followed one another; ** , 

Method 

Subjects , Another group of 25 undergraduate research participants joined 
the 25 nho had served in the wlthln-subjects part of Experiment 1, and whose 
data are depicted agaln^ln the Results section chat follows* Subjects In 
these two groups vere from the same Introductory psychology course and were 
tested within 3 weeks of che same school term* ' 

Problems * The problems for the new subjects were Identical to those in „ 
Experiment 1, except that one blan 1 space was inserted between successive 
sampling Intervals^along the time line. This format is Illustrated 4 in a 

sample Item (Problem 11) : 

A_ „ A ^ A_ A_ ^ ^ A A A_ A _ A_ A A_ A^ 

B B "§ B ~*i B B 1 E ~B B B B B ~B 
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Procedure . The procedure for the new subjects given the broken time 
lines was identical to that for the former subjects given the unbroken tiflfe 
lines in Experiment 1. ^ * 

Results . 

Table 4 shows the means and standard deviations of subjects' judgments 
for the 24 problems given in the broken and the unbroken time line conditions 
of Experiment 3. , Again, subjects 1 ratings were positive functions of £(B/T) - 
£(B/f) and .£(B). 



Insert /Table A about here 

Figure 4 graphically illustrates subjects 1 rating scores as separate 
functions of £(B/T) - £(B/T) and £(B) for each time line condition. Analysis 
of variance simultaneously compared these two sets of functions. 



Insert Figure 4 about here 



The left panel of Figure 4 shows, subjects 1 ratings as a function of 
£(B/T) - £(B/T). Overall, ratings were reliable linear, F(l, 576) - 542.75, 
£ ^ .001, quadratic, Hl t 576) - 34.32, £ < .001, and cubic, F(I, 576? » 
20.35, £ < .001, functions of tap-buzz contingency. Additionally , there was a 

.reliable linear contingency by time line interaction, F(l, ^76) ■ 5.08, £ * 
.025, and a near significant quadratic contingency by time line interaction/ 
F(l, 576) * 3.18, £- .075. Therefore, separate analyses of variance were 

^conducted on the data for the group given the broken time line and for the 
group given the unbroken time line. For both the broTsen and unbroken time 
lint groups, ratings were reliable linear functions, F(l, 24) - 83.74, £ < .001, 
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and 11(1, 24) » 74,76, £ < .001, respectively; quadratic functions, F(l, 24) - 
7,17, £» .013, and F(l, 24) «■ 28.07, £ < .001, respectively^; and cubic func- 
tions of contingen^, £(1, 24) * 24.83, £ < .001, and F(l, 24) * 10.96, £ » 
.003, respectively. Thus, although the contingency-rating functions were 
similar, judgments of ^contingency were more strongly differentiated for sub- 
jects in the broken time line group; this greater differentiation was generally 
more notable for negative than forvpositive contingencies. 

The right £anel of Figure 4 portrays subjects' ratings as a function of 
£(B), Overall, ratings were reliable linear, F(l, 576) * 139.87', £ < .001, 
and quadratic, £(1, 576) » 25 .33 , £ «-.001, functions of outcome probability. 
Additionally, there was a reliable quadratic outcSme^ probability by tilne line 
^interaction, F(l, 576) » 6 t 18* £ » .(U3. Separate analyses oi variance were 
therefore conducted on the data from the two time line groups. For both the 

r 

group given the broken time line and the grotip given the unbroken time line, 
ratings were reliable linear factions of £<B), £(1, 24) t - 20.62, £ < ,001, 
and £(1, 24) « 32.63', £< .001, respectively. However, the^uadratic trend 
was reliable for the broken time line group only, £(1, # 24) - 24.01, £ < .001. 
Thus, the probability-rating functions of the two time line groups were 
similarly sloped, ^although the function for the broken time line appeared to 
turn downward at high outcome probabilities,, more than the function for the 
unbroken time line. * 

To assess the relative contributions of response-outcome contingency and 
outcome probability to subjects* judgments, the percentage of variance accounted 
fox; by each factor was determined as in Experiments- 1 and 2* For the broken 
time line group, j^CB/T) ^£(B/T) accounted fo^ 77.31% of the total problem 

variance^Snd £(B) accounted for 19.082; for the unbroken time line group, the 

\? 

corresponding scores were 71.372r*and 24.10%. 
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Discussion 



We introduced the broken time line format in Experiment 3' to partition 
the time line continuum into discrete sampling intervals. The results of the 
experiment indicate th|t this manipulation had an effect on judgments of the 
problem set. Subjects judging broken time lines showed gxeater differenti- 
ation in their ratings as a function of the scheduled contingency thaft sub-' 

m 

t f 

jects judging unbroken time l^nes. This increased differentiation was generally, 
more prominent for negative, than for positive relationships, a difference ^ 
which was also true of subjects judging tabled information in Experiment 2. 

Thus, the results of subjects who viewed the broken time lines dqpli^ate 1 
in some respects the behavior of subjects judging on the basis of tabled 

information. Our ability to increase the accuracy of contingency judgments 

1 

by this manipulation enhances confidence in our interpretation that subjects 
made errors in identifying discrete event pairings in the continuous time 
lines. The similarity of judgments of tabled and broken time line information 
suggests that one function of the table may be to separate a stream of events 

™„ , _^ ___„ „ _^ _ rding to the 

type of event pairing and thus may be more accurately incorporated into £ 
contingency judgment.* 

While breaking the flow cf the time line into discrete sampling intervals 
yielded judgments more similar to those ^dade with summary table presentation, 
inspection of Figures 3 and 4 shows that the judgments obtained under these 
two conditions were not identical. Contingency-judgment functions under the 
broken time line format were less symmetrical about zero than undej: the 
summary table format, and probability-rating functions were steeper In the 
former condition than in the latter. Thus, other factors may well, contribute 
to the differences in contingency judgments obtained with the time line and 
summary tabl,e formats in Experiment 2. ' 
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Experiment 4 

Thus far, our leading interpretation of the problems created by*a con- 
tinuous representation of events Is that people have difficul v breaking the 
stream Into discrete units. An alternative approach to testing Jthis account 
might be to teach people to parse the time line into the component units. If 
such training produdfes judgment functions like those found in our broken time 
line and table formats, such findings would further support this as the source 
of judgment differences. A second luncttton of the table mentioned earlier 
might be to jffer subjects numerical summaries of the information about the 

t 

four event combinations. This summarized information may be more readily 
incorporated 4nto a decision rule in judging event covariations. In this way, 
judgment accuracy might be further enhanced if subjects were asked to count 
the occurrences of each event-state combination and note these frequencies 
in a table. By this process, subjects would effectively convert a time line 
into a table format. 

Our fourth and final experiment used each of these approaches. One group 
of subjects was presented with the 24 problems in our original time line 
format, but were taught to break the line into response-outcome intervals 
(line** interval) . A second group received these instructions and were also 
asked to count the frequencies of each event-state pairing and write those 
frequencies in a table (line-table). Time, line and table groups using our 
original instructions served a- comparison conditions for these manipulations. 
Improved judgment by line-interval subjects compare J to time line subjects 
would further implicate line segmenting as a factor in contingency judgment;. 
Further improvements by line-table subjects would suggest that summary infor- 
mation is also an important function of the tabbed fortnat. Because we found 
sex differences In contingency judgment in related work of ours (Shaklee & 
Hall, in press), sex was included as a factor in chis experiment. 

27 
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Method 

Subjects . A total of 160 introductory psychology subjects served in the 
experiment with 20 males and 20 females in 'each of four judgment conditions. 

Problems . The 24 contingencies for this experiment were the same as 
those in the previous experiments. Format of problems in the time line and 

table representations was the same £s that used in Experinents 1 and, 2. 

v 

Procedure . The* introduction to the troubleshooting problems was identi- 
cal to that used in the previous studies* except that the problem raPresenta- 
tion was explained in one of four ways: 

Line: These instructions were the same as those used in Experiments 1 
and 2« 

Line*Interval: These problems were represented in a time line like that 

used in Experinents 1 and 2, but in this case subjects were specifically 

instructed how to break the time line into response-outcome intervals. In* 

structions were as follows: 

Each dash on the time line represents one unit of time. 
Time units come in pairs, with the first an opportunity 
for a response (Tap or No Tap) and the second an opportunity 
for an outcome (Buzz or No Buzz). Thus* pairs of successive 
intervals can be of four types: Tap-Buzz, Tap-No Buzs, No 
Tap-Buzz, No Tap-No Buzz. For each of the time lines, please 
rate the degree to which Kim's tapping affects the rate of 
the radio's buzzing, from "prevents the sound from occurring" 
to "causes the sound to occur." 

♦ 

Line-Table: Problems and instructions were identical to^thosfe in the 

r 

Line-Interval condition, except that, each problem was accompanied by a bjjank 

table labeled as in the previous table condition of Experiment 2. Subjects 

were instructed to complete the table before making their judgment. Instruct 

tions were as follows: 
# 

Each dash on the time line represents^ one unit of time. 
Time units come. in pairs, with the first an opportunity lor 
a response (Tap or No Tap) and the second an opportunity for 
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am outcome (Buzz or 



att^ out come (Buzz or No Buzz) « Thus t Pairs of successive 
intervals can be of four types: Tap-Buzz, Tap-No Buzz, No 
Tap-Buzz, No Tap-No Buzj. For each time line, please count 
the frequency of each of these four types of interval pairs, 
Entex; those frequencies in the table to ttti right of the time 
line. Once you have completed the table, please rate the 
degree to which Kim's tapping affects the rate of the radio's 
buzzing, from prevents the sound from occurring" to "causes 
the sound to occur." 

Table: Problems and instructions in th*s condition were identical to 
those in Experiment 2. 

In each condition, the information offered in the instructions tf^s shown 
on a sonrple problem illustrating each type of response-outcome pairing. 
Subjects were invited to ask any questions they might have, after which they 
proceeded at their own pace through the problem set. 

Results 

Means and standard deviations of subjects 1 judgments for the 24 problems 
in each judgment condition are shown in Table 5. Figure 5 illustrates sub- 
jects 1 judgments of the nine contingencies, £(B/T) ~ £(e/T), and the eight 
probabilities of buzzing sound, £(B), for the four judgment conditions. These 
functions were simultaneously compared by analysis of variance, including sex 
of subject and judgment condition as factors. Paired follow-up analyses were 
conducted on interactions, setting alpha at .025 to reduce the experiment^wide 



error rate. 



Insert Table 5 aad Figure 5 about here 
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The overall analysis yielded reliable linear, F(l, 152) - 851,86, £< .001, 
quadratic F (1 , 152) » 100,92, £ < ,001, and cubic F(l, 152) - 12,52, £ < .001 
trends of response-outcome contingency on subjects* judgments. As ii: our 
previous experiments, judgments were a function of problem contingency, but 
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with judgments of negative relations closer Co zero than chose of positive 
relations* This analysis alio shoved a main efface of judgment condition, 
F(3, 152) » 11*40, £< *001, although that effect is qualified by a contin- 
gency by condition interaction, F(23, 3496) - 2*47, £< *001* As seen in the 
left portion of Figure 5> the form of this interaction shows that judgments in 
the Table condition were most symmetrical about zero, judgments in the Line 
condition were least svmmetrical* and judgments in the Line-Interval am* Line- 
Table conditions fell between these two extremes* Follow-up analyses .compared 
contingency judgment functions for selected condition pairs* Line-Interval 
and Line conditions were compared to identify the effect of the interval 
segmenting instructions* This analysis showed Line-Interval subjects to be 
significantly different from Line subjects; linear trend F<1, 76) » 11*12, £ 3 
*001, the quadratic trend approaching significance F(li 76) * 4*92, £ * *029* 
Comparison of Line-Table and Line-Interval contingency functions shoved that 
tabling the frequency information had no additional effect on judgment accuracy* 
Line-Table and Table jr "ges were compared to see if judges who tabled the 
frequency information for themselves were equivalent in judgment to those who 
judged tables provided by the experimenter. This comparison showed that 
contingency judgment functions were n<3t equivalent £ 0 r the two groups, with 
Line-Table and Table judges reliably different in quadratic trend, £(1, 76) » 
5*83, £-**018, but not in linear or cubic trends* 

Sex differences in contingency functions were statistically significant, 
with the contingency- judgment function for females flatter than that for 
males; linear trend F(l, 152) * 3*94, *049, cubic trend F(l, 152) » 4.38, 
£ m *038* This aex eifect did not interact significantly pith judgment condition* 

As in our previous experiments, subjects 1 judgments were an increasing 
function of the probability of the buzzing sound (see right portion o£ figure 
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5), Racings showed significant linear, F(i, 152) » 210, 66, £ < ,001, quad- 
ratic, F(l, 152) - 80,90, £< ,001, and cubic, F(l, 152) - 4.58, £ » .034, 
trends as a function of p(B), Unlike previous analyses, however, these 
probability-judgment functions were noc reliably affected by judgment condi- 
tion, although the Line group again showed the greatest effect of jj(B) and the 
Table group showed the least effect, Effects of £(B) alsP did not differ as a 
function of subjects' sex. 

The relative contributions of response-outcome contingency and outcome 
probability in each of the four conditions were determined as in the prior 
experiments, For the Table group, £(B/T) - £(B/T) accounted for 89,07% of the 
total problem variance and £<B) accounted for 9.47%; for the Line-Table group, 
the corresponding scores were 80,97% and 17.02%; for the Line-Interval group, 
the scores were 76,04% and 17,61%; and f^fc the Line group, the scores were 
71.38% and 22.64%, In only the lacter two groups was the residual variance 
significant: Line-Interval residual - 6,35%, F(17, 646) « 6,72, £ < ,001, and 
Line residual » 5,98%, F(17, 646) » 2.25, £* ,003. 

Since frequency judgment errors may detract from contingency judgment 
accuracy, the frequency tables generated by subjects in the Line-Table condi- 
tion were examined for accuracy. Overall, errors were small, with mean 
absolute deviations of ,15, ,10, ,30, and 1,65 for Tap-Buzz, Tap-No Buzz, No 
Tap-Buzz, and No Tap-No Buzz frequencies, respectively. In view of the dif- 
ferential judgments of positive and negative relationships in this condition, 
frequency judgment accuracy was compared for problems representing positive 
aiid negative contingencies, Absolute deviations were averaged across table 
cells for this analysis. A matched-pairs £-test showed no reliable differences 
In frequency judgment errors on positive and negative contingencies , £(39) < 1. 

Discussion 

Experiment 4 represents a* conceptual replication of our third experiment. 
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In Experiment 3, we broke the time line into discrete units. In this experi- 
ment, we taught the subjects themselves to define these intervals. The results 
indicate that the manipulations in the two experiments had similar effects. 
Line-Interval and Line-Table subjects in Experiment 4 produced contingency- 
judgment functions intermediate to those of our Line and Table subjects. 
Line-Interval and Line-Table subjects 1 contingency-judgment functions were 
more symmetrical about zero than that of Line subjects, although the two new 

conditions did not differ from each other. This failure to find additional 

> 

improvement by subjects who completed a frequency tsble indicates that the 
availability of summary information contributes little to judgment accuracy. 
However, the similarity of these two functions to that of subjects in our past 
broken time line condition enhances our confidence in the problem of event 
segmenting as a source of error in judging negative relationships. 

The finding that Line^Table judges are also less accurate than Table 
judges is a bit of a surprise. These subjects have effectively converted time 
line information into a tabled fdiroat. However* the accuracy of that conversion 
is a second question. Since any devi^tidns in frequency judgments must 
necessarily be in the direction of lower accuracy, subjects in this condition 
may have somewhat erroneous information on Which to base their judgments. 
However, a look at subjects* frequency counts indicates reasonable accuracy; 
indeed, 12 out of AO subjects did not show a single ervor on any of the 24 
problems. In addition, error rates were similar on negative and positive 
^contingency problems. 'Thus, inaccuracy of "frequency judgments constitutes a 
weak account of the difference in judgment functions of Line-Table and Table 
subjects. 

These differences between Line-Table and Table judgments replicate the 
stimulus presentation effects of Ward and Jenkins (1965) in a substantially 
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different format . Their subjects viewed sequences of a vent-out come pairs 
(cloud seeding or not/rain or no rain), each sequence indicating some degree 



of positive relationship* When the sequence was complete, one group of sub- 
jects saw a table summarizing the frequencies of each of the event-state 
combinations. A second group saw the tabled information only. Ward and 
Jenkins found -that subjects who saw the tabled information after the event 
series v*ere less accurate in their judgments than those who saw the tabled 
information alone. It was this finding that inspired the experimenters to 
conclude that viewing the event sequence had caused the subjects to represent 
the information in a way that . the table failed to counteract* perhaps dif- 
ferentially emphasizing the relative importance of particular event-state 
pairings. Our own results parallel these past findings closely. In our case, 
however, subjects viewed event contingencies in a linear representation free 
of memory demands. / 



As in our previous experiments, subjects* judgments here were biased by 
the probability of the burning sound. Howevar, unlike Experiments 2 and 3, 



ment conditions. The failure to replicate this finding is surprising and 
difficult co account for given the comparability of other aspects of the 
present results to our other previous outcomes. This finding does temper our 
confidence in the previous result that judgments of tabled information are 
relatively free of the effect of the probability of outcome, 

Finally, this experiment showed a reliable effect of sex, with contin- 
gency-judgment functions of females reliably flatter than those of malp?. 
This difference may indicate that females have a higher judgment error rate 
than males, contributing to flatter functions. This interpretation is con- 



gruent with findings in our related work (Shaklee & Hall, in press) showing 



t 




the extent of that bias was not reliably different in the Line and Table judg- 
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that Temales~use simpler , less accurate rules thlin^tnose used by maTes^to 
judge event covariations* An alternative interpretation of the sex differences 
in che present experiment is that- the two sexes judge the problems with similar 
accuracy, but that the females use a more limited range of the scale to make 
their judgments. However* a comparison of judgments indicates that the two 
sexes use the scale extremes (+4) at comparable rates (11«3Z and 12;. 2% of 
judgments for males and females, respectively), ruling out response conser^ * 
vatism as a viable account of this sex difference. 

Concluding Conusant s 
In overview, the results of four different experiments suggest that 
judgments of interevent contingency importantly depend on the method of 
presenting information about event pairings. Most accurate were judgments of 
summary table information (Experiments 2 and 4); least accurate were judgments 
of information presented in a continuous time line format (Experiments 1, 2, 
and 4). The accuracy of subjects judging partitioned time lines (Experiment 
3) fell in between that of the other two conditions. Subjects 'trained to 
segment continuous time lines (Experiment 4) made judgments similar to those 
who saw partitioned tine lines. < This evidence suggests that Ward and Jenkins 
(1965) were correct in their suspicion that presentation format may influence 
subjects 1 treatment of frequency information in making contingency judgments. 
Our evidence indicates that subjects may break event sequences into different 
discrete event pairings depending on the format in which the frequency infor- 
mation is presented. This explanation accounts well for our own findings, but 
may not be similarly useful in explaining the effects of* relationship direction 
in some past paradigms. As noted earlier, slide or card sequence presentations 

1 s 

offer event pairings as discrete units rather than as event continua* 

r 

This interpretation offers a ready account for the finding in past 
research that subjects judge negative relationships less accurately than 



\ 
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positive relationships. Past researchers have suggested t^t subjects know 

how to judge positive, but not negative contingencies* Allan (1980) , however, 

pointed out one difficulty with this interpretation; subjects who only know 

how to judge positive relationships must be able to distinguish between posi- 

tive and negative relationships in order to apply the appropriate rule to 

i 

positive contingencies. Presumably, a different, less accurate rule is 
applied to negative (and independent) relationships. Thus, this interpreta- 
tion require** that an individual maintain more than one rule to judge event 
contingencies, and that the person know, when to apply which rule to which 
relationship. ^ 

Our analysis indicates a single judgment problem which would result in 
differential accuracy on positive and negative relationships: that is, sub~ ^ 
jects' boundaries for event segments depend on the other events in the stream. 
Positive relationships are typif ied by many response-outcome pairs which would ^ 
define a brief time interval as a response --out come unit. However, where few 
outcomes promptly follow responses, the observer may accept relatively delayed 
outcomes as "caused" by the response. The estimate of response-outcome pairs 

is inflated, resulting in an illusion of a relationship which is less negative 

#* 

*b$ian is objectively the case. 

t * 

We would argue that the problems our subjects encountered in the time 
line format could be similar to those encountered in judgments of real world 
contingencies — response-outcome delays may vary in everyday experience. One 
task of the perceiver is then to define which sequences represent true response- 
outcome pairings. Investigations of the cues used to break event sequences 
into discrete units are rare. Our evidence suggests that understanding this 
process may be important to our ability to account tor contingency judgments. 
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Table 1 , ; 4 

Frequencies of Response-Outcome Possibilities 

in Each Experimental Problem > 
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Table 2 



Means and Standard Deviations (In Parentheses) of Subjects* Ratings 
In the Between- and Wftthln-Subjects Parts of Experiment 1 
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Table 3 

Means and Standard Deviations (in Parentheses) of Subjects 1 Ratings 
Under the Time Line and Summary Table Formats of t Experiment 2 





"1 






P 


(B/T) - p(B/T) 








p(l») 


-1.00 


-0.75 


-0.50' 


-0.25 


0.00 


+0.25 


+0.50 


+0.75 


+1.00 










Time 


Line 










.125 








-2.38 




-2.09 
















(2.06) 




(2.11) 








.250 






-1.09 




-1.15 




0.56 












(2.05) 




(2.20) 




(2.19> 






.375 




-1.32 




-0.62 




0.94 




1.41 








(1.81) 




(1.78) 




(1.24) 




(1.97) 




.500 


-0.94 




-O.iv 








1.29 




2.47 




(2.U) 




(1.38) 




(1.75) 




(1.74) 




(2.29) 


625 




0 62 




0.32 




1.29 




1.85 








(1.85) 




(1.34) 




(1.15) 




(1 .80) 




.750 






0.71 




0.85 




1.85 












(1.72) 




(1.54) 




(1.77) 






.875 








1.76 




1.62 
















(2.04) 




(2.00) 








1.000 










0.79 








. / 












(2.26) 










Summary Tabl.e 


.125 








■ -1.41 




-0.21 
















(2.18) 




(2.18) 








.250 






-1.09 




-0.38 




O.tt 












(2.72) 




(1.91) 


6.29 


(2.36) 






.375 




-1 .03 




-1.03 






1.26 








(2.55) 




(2.02) 




(1.49) 




(1.82) 




.500 






-0.74 




0.24 




1:15 




2.44' 




(2.17) 




(1.87) 




(i.06) 




(1.65) 




(J. 87) 


. 625 




-1 .68 




-0,06 




1.03 




1.24 








(1.74) 




(1.55) 




(1.54) 




(2.38) 




.750 






-0.29 




0.50 




1.62 


j 










(2.01) 




(3 .54) 




(1.97) 






.875 








0.38 




0.91 









(2.26) (2.12) 
1.000 0.50 -~ 

(1.74) 43 



Table * 



Means and Standard Deviations (in Parentheses) of Subjects' Ratings 
Under the Broken and Unbroken Time Line Conditions of Experiment 3 



p(B/T) - p(B/T) 
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Means and Standard Deviations (in Parentheses) of Subjects' Ratings 
in the Pour Conditions of Experiment 4 

p(B/T) '- p(B/f) 
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Figure Captions 

Figure 1, The 24 different response-outcome prjbleras on the coordinates 
p(B/T) and p(B/T). The top portion locates the nine different response- 
outcome contingen ies, p(B/T) - p(B/T)» on the unit square; the bottom portion 
locates the eight different outcome probabilities, p(B), See text for addi- 
tional explanation. 

Figure 2. Contingency-judgment functions (left) and probability-judgmett\ 

V 

functions (right) in the within- and be*— ^l-subjects parts of Experiment 1. 

Figure 3. Contingency-judgment functions (left) and probability-judgment 
functions (right) under the time line and summary table formats of Experiment 
2, 

Figure 4. Contingency-judgment factions (left) and probability-judgment 
functions (right) under the broken and unbroken time line conditions of Experi- 
ment 3. 

Figure 5. Contingency-judgment functions (left) and probability-judgment 
functions (right) under the four experimental conditions of Experiment 4. 
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